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ABSTRACT 

Aims. We intend to compile a new galaxy group and cluster sample of the latest available SDSS data, adding several parameter for 
the purpose of studying the supercluster network, galaxy and group evolution, and their connection to the surrounding environment. 
Methods. We used a modified friends-of-friends (FoF) method with a variable linking length in the transverse and radial directions to 
eliminate selection effects and to find reliably as many groups as possible. Using the galaxies as a basis, we calculated the luminosity 
density field. 

Results. We create a new catalogue of groups and clusters for the SDSS data release 8 sample. We find and add environmental 
parameters to our catalogue, together with other galaxy parameters (e.g., morphology), missing from our previous catalogues. We take 
into account various selection effects caused by a magnitude limited galaxy sample. Our final sample contains 576493 galaxies and 
77858 groups. The group catalogue is available at http : / /www . aai . ee/ ~elmo/dr8groups/ and from the Strasbourg Astronomical 
Data Center (CDS). 
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1. Introduction 

Observations of the local Universe have shown that basically all 
galaxies are located in groups - it is their natural environment. 
Groups and clusters of galaxies form the basic building blocks of 
the Universe. Therefore, it is essential to extract groups of galax- 
ies from galaxy surveys, and their study can provide new under- 
standing of the evolution of galaxies, of the large-scale structure, 
and of the underlying cosmological model. 

In our previous papers (Tago et al. 20d8[ l2010h we have ex- 
tracted groups from the SDSS DR5 and DR7 samples, respec- 
tively. In these papers we have given an extensive review of pa- 
pers dedicated to group search methods and of the published 
group catalogues. In this introduction we present only a short 
review of the studies of galaxy groups. 

During the last decade, several group catalogues that use 
spectroscopic redshifts have been published, either based on 
the 2 dFGRS (Eke et al. 2004; Yang et al. 2005; Tago et al. 
20061) . or on earlier release s of t he SDSS (jE inasto et al. 
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I2OO7I) . However, similar algorithms used to com- 



pile these catalogues have yielded groups of galaxies with rather 
different statistical properties. 

Several authors have recently compiled group catalogues up 
to the redshift . 6: GA MA (Galaxy And Mass Assembly) by 
iRobotham et al.l (|2011|) is a galaxy groiip cata logue based on 
the SDSS target catalogue; iFarrens et al.l (1201 li) derived a cat- 



alogue, based on the LRGs and QSOs in the 2dF-SDSS sur- 
veys. Using photometric redshift s several group/cluster cata- 
logue s have been corn piled (e.g. iGal et al.l l2009t ISzabo et 
I2OI ih . lHao et al.l (l2010h applied a Gaussian mixture BCG algo- 
rithm to the SDSS DR7 data and assembled a photometric group 
catalogue up to the redshift 0.55. Using spectroscopic redshifts, 
iKnobel et al.l ( 12009.) compiled the deepest group catalogue so 
far, reaching up to the redshift 1 in the zCOSMOS field. In 
addition, catalogues of rich galaxy clusters have been created 
by Miller et al. (2005), Aguerri et al. (2007), and Popesso et aT| 
(120071) . We discuss and compare some of these catalogues in a 
separate paper 

The papers dedicated to group and cluster search use a wide 
range of both sample selection methods as well as cluster search 
methods and parameters. The choice of the methods and pa- 
ramet ers depends on the goa l of the catalogue. For example, 
while 'Weinmann et al.l (l2006l) searched for compact groups in 
the SDSS DR2 sample, a pplying strict c riteria in the friend-of- 
friend (FoF) method, then iBerlind et al.l (|2P06) applied the FoF 
method to the volume-limited samples of the SDSS with the goal 
to measure the group multiplicity function and to constrain dark 
matter haloes. Hence, the parameters of the algorithm depend on 
the goal of the study. 

Our goal is to generate an up-to-date catalogue of groups and 
clusters for large-scale structure studies. The catalogue is based 
on the SDSS data release 8 (DR8). Since the SDSS spectroscopic 
main sample basically did not change from DR7 to DR8, we use 
exactly the same group finding algorithm and parameters as de- 
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scribed in lTago et alJ (1201 Ol) . yielding a group sample with sim- 
ilar properties. The photometry of the galaxies in DR8 has been 
reprocessed, yielding more accurate luminosities. This is impor- 
tant for detailed photometric galaxy modelling (Tempel et al. in 
prep). 

Compared to our previous catalogues, we added several ad- 
ditional descriptors, including environmental parameters (both 
local and global) and morphology. For the DR7, such data 
have been already used in several papers: the environment and 
morphology have be en used to study the environmental effects 
on galaxy evolution dTempel et al.ll201 lb : global environments 
have been used to extract s uperclusters from the cosmic net- 
work (iLiivamap et al .1120121) . The present catalogue, based on 
the DR8, has already been used to compare the local and global 
environments of galaxie s (Lietzen et al. in p rep), to study the 
structure of rich groups (lEinasto et al.ll2012f) . and to study the 
photometric structure of galaxies (Tempel et al. in prep). 

The paper is organised as follows. The data used are de- 
scribed in Sect. |2] Section [3] gives a brief overview of the 
group finding algorithm used, together with a short com- 
parison with our DR7 catalogue. In Sect. |4] we describe 
our method to calculate the luminosity density field. In 
Sect. |5] we describe the additional galaxy and group param- 
eters. All the parameters in the resulting catalogue are de- 
scribed in Appendix [A] The catalogue can be downloaded 
from http://www.aai.ee/~elmo/dr8groups/ or from the 
Strasbourg Astronomical Data Center (CDSfl 

Throughout this paper we assume the following cosmology: 
the Hubble constant Hq = 100/zkms^'Mpc"', the matter den- 
sity On, - 0.27 and the dark energy density Oa = 0.73. 

2. SDSS data 

Our p resent catalogue is based on the SDSS DR8 jAihara et al.l 
1201 ih . We used only the main contiguous area of the survey 
(the Legacy Survey). The galaxy data were downloaded from the 
Catalog Archive Server (CAS) of the SDSS. The primary selec- 
tion was based on the specphotoall table in the CAS and we 
used only those objects that were classified as galaxies. Since the 
spectroscopic galaxy sa mple is complete o nly up to the Petrosian 
magnitude nir - 17.77 (Strauss et al. 2002), we select that as the 
lower magnitude limit of our sample. Actually, that limit was ap- 
plied after the Galactic extinction correction was used, yielding 
an uniform extinction-corrected sample. Initially, we set no up- 
per magnitude limit to our catalogue. However, since the SDSS 
sample is incomplete for bright objects due to the saturation of 
CCDs, we used the limit - 12.5 for the luminosity function 
and for the weight factor calculations. The bright limit afffects 
only the nearby regions d < 60/z^'Mpc (see Fig.|2]). 

However, the sample is still affected by fibre collisions - 
the minimum separation between spectroscopic fibres is 55". 
For this reason, about 6 per c ent of galaxies in the SDSS 
are without observed spectra. In iTago et aT] (l2008h we showed 
that it does not generate any appreciable effects, when using 
our group-finding algorithm. In the present paper we track the 
missing galaxies and add a flag to the galaxy/group with a 
neighbour(s) missing from the r edshift catalogu e. According to 
iPatton & Atfi eld (2008) and ElHs on et al.l ( l2008h . 67.5% of close 
pairs with angular separations below 55 "are missing due to fibre 
collision. This reduces the number of galaxy pairs in our group 

' Galaxy and group/cluster tables will be available at the CDS 
via anonymous ftp to cdsarc.u-strasbg.fr (130.79.128.5) or via 
|http://cdsweb.u-strasbg.fr/cgi-bin/qc at?J/A+A/| 



catalogue. Using the missing galaxies, we can estimate the num- 
ber of missing pairs. In the SDSS sample, the absent galaxies are 
more likely to reside in groups and there are only 4% of single 
galaxies which have a missing companion. Furth ermore, only 
60% of them have redshift close to the neighbour's dZehavi et al.l 
i200l . As a result, the estimated amount of missing pairs in our 
catalogue is about 8%. 

The galaxy sample, downloaded from the CAS, needs fur- 
ther checking, since it includes duplicate entries and in some 
cases objects that are not galaxies at all (but still classified as 
galaxies in the CAS). To obtain a clean sample of galaxies, we 
firstly excluded duplicate entries, using the redshifts and angular 
distances between galaxies. We also used the SDSS Visual Tool 
to examine the cases, where duplication was unclear (merging or 
visually extremely close galaxies). We also examined all of the 
1000 brightest galaxies in the remaining sample and excluded 
the entries, which were not galaxies: in most cases these objects 
were oversaturated stars or other artefacts. This step was crucial, 
since for the luminosity density field, the brightest objects that 
are not galaxies can cause the biggest uncertainties in the final 
estimates. Additionally, we visually checked the galaxies, which 
had unphysical colours, and excluded all spurious objects. 

After correcting the redshifts relative to the motion in re- 
spect of the CMB, we put the lower and upper distance limits 
to z = 0.009 and z = 0.2, respectively. The lower limit was set 
to exclude the local supercluster, and the upper limit was cho- 
sen, since at larger distances the sample becomes very diluted. 
As a result, after all the limits and exclusions, our final sample 
includes 576493 galaxies. 

The apparent magnitude m was transformed into the absolute 
magnitude M according to the usual formula 

-25-51og,o(flfL)-/:, (1) 

where di is the luminosity distance in units of /z^'Mpc, 
K is the A; -He-correction, and the index A refers to the 
ugriz filters. The A:-correctio ns were calculated with the 
KCORRECT (v4^) algorithm (iBlanton & RoweisI |2007|) and 
the evolution corrections were estimated, using the luminos- 
ity evolution model of Blanton et alJ (l2003h : Kg = c-z, where 
c = -4.22, -2.04, -1.62, -1.61, -0.76 for the ugriz filters, re- 
spectively. The magnitudes correspond to the rest-frame (at the 
redshift z-0). 

Figure[T]shows the sky distribution of galaxies in the equato- 
rial coordinates for our sample, covering 7221 square degrees in 
the sky ( Martinez et alJl2009ii) . Figure |2] shows the distance ver- 
sus absolute magnitude plot. The flux-limited selection is well 
seen: further away, only the brightest galaxies are observed. 

3. Construction of the group catalogue 

The details of our group finding algorithm are explained in detail 
in Tago et al. (2008, 2010). In this research note we give only a 
brief outline of the method used. 

One of the most conventional methods to search for groups 
of galaxies is clus ter analysis that was introduced in cosmology 
by Turner & Gott (1976). This method was named friends-of- 
friends (FoF) by Press & Davis (1982). With the FoF method, 
galaxies are linked into systems, using a certain linking length 
(or neighbourhood radius). Choosing the right linking length is 
rather complicated. In most cases, the linking length is not con- 
stant, but varies with distance and/or other parameters. 

Our experience shows that the choice of the linking length 
depends on the goals of the specific study. In our group cata- 
logue, our goal is to obtain groups to estimate the luminosity 
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Fig. 1. The SDSS contiguous sample area in the equatorial coor- 
dinates. The sky coverage is 7221 square degrees. 




Distance (h'^Mpc) 

Fig. 2. The distance versus the absolute magnitude of the galax- 
ies. The faint magnitude limit is fluctuating due to the k- 
correction. The bright magnitude limit nir = 12.5 (used for the 
weight factor) affects only the nearby region d < 60/i"'Mpc. 
The solid line shows the weight factor Wd at a given distance 
(see Sect. |4]for more information). 



density field and to study the properties of the galaxy network. 
Hence, our goal is to find as many groups as possible, whereas 
the group properties must not change with distance. In our group 
definition, we tried to avoid the inclusion of large sections of sur- 
rounding filaments or parts of superclusters. 

To find the proper scaling for the linking length with dis- 
tance, we created a test group catalogue, using a constant 
linking length. Then we selected in the nearby volume {d < 
200/i^'Mpc) all groups with more than 20 members. Assuming 
that the group members are all at the mean distance of the group, 
we determined their absolute magnitudes and peculiar radial ve- 
locities. Then we shifted these nearby groups, calculating the pa- 
rameters of the groups (new ^H-e-corrections and apparent mag- 
nitudes), as if the groups were located at larger distances. As 
with the increasing distance more and more fainter members of 
groups fall outside the observational window of apparent mag- 
nitudes, the group membership changes. We then determined 
new properties of the groups - their multiplicities, characteristic 
sizes, rms velocities, and number densities. We also calculated 
the minimum FoF linking length necessary to keep the group to- 
gether at this distance. Determining the mean values of the group 
linking lengths, we found that the linking length in our group 
finding algorithm increases moderately with distance. A good 
approximation of the scaling law for the linking length (dLL) is 
the arctan function 

diLiz) = diLfl [1 + fl arctan(z/z*)] , (2) 



Table 1. The numbers of groups (A^gr) and galaxies (A^gai) in dif- 
ferent group richness (Mich) bins. 



Group richness 




Fraction" 




Fraction'' 


All 


77858 


100.0 


265578 


46.1 


A^rich > 3 


30515 


39.2 


170892 


29.6 


A'rich > 4 


16358 


21.0 


128421 


22.3 


A'rich > 5 


10150 


13.0 


103589 


18.0 


A'rich > 6 


7078 


9.1 


88229 


15.3 


A'rich > 8 


4078 


5.2 


69060 


12.0 


A'rich > 10 


2686 


3.4 


57369 


10.0 


A'rich > 15 


1278 


1.6 


41091 


7.1 


A'rich > 20 


772 


1.0 


32646 


5.6 


A'rich > 30 


412 


0.5 


24169 


4.2 


A'rich > 40 


233 


0.3 


18145 


3.1 


A'.-ch > 50 


147 


0.2 


14349 


2.5 


A'rich > 75 


66 


0.08 


9528 


1.6 


A'.-.ch > 100 


35 


0.04 


6965 


1.2 


Nncb = 2 


47343 


60.8 


94686 


16.4 


3 < A'.-i.h < 4 


20365 


26.2 


67303 


11.7 


5 < A'nch < 9 


7464 


9.6 


46220 


8.0 


10 < A'rich < 29 


2274 


2.9 


33200 


5.6 



Notes. Fraction of groups in per cent. Fraction of galaxies in 
groups (per cent). 



where dn^ is the value of linking length at the initial redshift; a 
and z* are the parameters. For the DR7 groups we find the pa- 
rameter values fl = 1 .00 and z* - 0.050. The ratio of the radial to 
the transversal linking lengths was 10 (if the radial linking length 
in kms"' is transformed into a formal "distance" in /z^'Mpc). 
We used the following initial linking length values: 250 km s"' 
for the radial length and 0.25 /i 'Mpc for the transversal length. 
We use the same values for the DR8 data. Higher initial values 
would lead to including galaxies from neighbouring groups and 
filaments; lower values exclude the fastest members in interme- 
diate richness groups. The selected parameters lead to reason- 
able group properties. 

Our final group catalogues are rather homogeneous. The 
group richnesses, mean sizes and velocity dispersions practically 
do not depend on their distance. The homogeneity of our cat- 
alogues have been tested al so by other authors. For example, 
iTovmassian & PUonisI (l2009h select poor groups from our SDSS 
catalogues and conclude that the main parameters of our groups 
are distance independent and well suited for statistical analysis. 

As a final result, the group catalogue includes 77858 groups 
with two or more members. Table [1] shows the numbers (and 
fractions) of groups and galaxies in different group richness bins. 
Almost half of the galaxies in our sample (46%) belong to a 
group and 10% of galaxies belong to groups with ten or more 
members. Most of our groups (60%) are groups with two (ob- 
servable) members and 2 1 % of groups have four or more mem- 
bers. 

3.1. Comparison witii tiie DR7 group catalogue 

Considering that the SDSS DR7 and DR8 galaxy catalogues are 
different (due to a reprocessing of the photometry of all earlier 
releases and due to different criteria for sample cleaning) it is 
useful to compare the respective group catalogues and to de- 
termine how different are the groups identified as "the same" 
in both catalogues. We have compared the richness of the 101 
richest groups in both catalogues. As shown in Fig. [3] there are 
no systematic trends. A few individual fluctuations can only be 
seen: in one case, a DR7 group is split into two groups in the 
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Richness DR7 



Fig. 3. Comparison of the richness of the 101 richest groups in 
the DR7 and DR8. There are no systematic deviations from the 
lineNrichdrv - Nrichdi-g. 



DR8 and in one case the two DR7 groups are merged into one 
group in the DR8. 

Compared to our previous DR7 catalogue, there are about 
1000 groups less in the present catalogue. The biggest difference 
is in the number galaxy pairs, the present catalogue contains 600 
pairs less than the DR7 catalogue. The number of groups with 
30 and more members is practically the same in both catalogues. 
We also matched the groups in the DR7 and present catalogues, 
using the group richness and the brightest galaxies in groups. We 
were able to match 94% of the groups, where the groups in DR7 
and DR8 have at least 90% common galaxies. The remaining 
6% are mostly smaller groups and/or are spUt into two groups or 
merged into one group. 

Since the main change between the DR7 and DR8 data con- 
cerns the photometry, we evaluated how the group luminosi- 
ties differ when compared to our previous catalogue. The up- 
per panel in Fig. |4] shows the magnitude differences between 
the DR7 and DR8 for galaxies. The reprocessed photometry in- 
creases the brightness of luminous galaxies, mainly due to a bet- 
ter estimate of the sky background. The bottom panel in Fig. |4] 
shows the relative difference in group luminosities for the DR7 
and DR8 data as a function of distance. In this plot, only the 
groups with four and more members and which were identified 
as the same in the DR7 and DR8 samples are shown. We see a 
slight trend with distance: nearby groups in the DR8 are a few 
per cent more luminous and more distant groups less luminous 
than the groups in the DR7. The same trend is also visible for 
galaxies and is caused by the reprocessed photometry. Figure |4] 
shows that for majority of the groups, the difference in the group 
luminosity between the DR7 and DR8 catalogues is less than 5 
per cent. 

We also compared other properties of groups in the DR7 and 
DR8 and our analysis confirms that the properties of the groups 
in both catalogues are very similar. In order not to overcrowd 
the paper with figures, we do not show these comparisons here. 
The basic properties of the groups in the DR7 are described in 
iTago et al. (2010) . for the DR8, the properties are similar. 



4. Estimating the environmental densities 

In this group catalogue, we add environmental densities to the 
galaxies. These densities are important when analysing the in- 
fluence of local and/or global environments on g alaxy evolution. 

We calculate the densities as described by iLiivamagi et al.l 
(l2012h . To calculate the luminosity density field, we need to 
know the expected total luminosities of groups and isolated 
galaxies. The primary factor that determines the calculation of 
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Fig. 4. The upper panel shows the differences between the SDSS 
DR7 and DR8 Petrosian galaxy magnitudes as a function of 
galaxy luminosity. The lower panel shows the relative difference 
between the SDSS DR7 and DR8 group luminosities as a func- 
tion of distance for groups with four and more members, which 
were identified as the same in the DR7 and DR8 samples. Solid 
lines show the 0.1, 0.5, and 0.9 quantiles, respectively. 




-19 -20 -21 
M^ - Slogh (mag) 

Fig. 5. The differential luminosity function in the r-band. The 
dashed line shows the double-power-law fit. The grey area shows 
the 95% confidence limits. The inset panel shows the relative 
difference (n(L)obs/n(i')mod - 1) between the analytical fit and 
the numerical estimate. 

group luminosities is the selection effect, present in a flux- 
limited survey: further away, only the brightest galaxies are seen 
(see Fig. |2l). To take this into account, we calculated for each 
galaxy a distance-dependent weight factor Wd 

J^^ Ln{L)dL 

where L12 = i^^Kfi-'^V^o-Mui) ^j.g jj^g luminosity limits of the 
observational window at the distance d, corresponding to the 
absolute magnitude limits of the window Mi and M2; we took 
Mq - 4.64 mag in the r-band (Blanton & Roweis 2007). To cal- 
culate the magnitudes Mi and M2 we use the average k + e- 
corrections at a given distance. Due to peculiar velocities, the 
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distances of galaxies are somewhat uncertain; if the galaxy be- 
longs to a group, we use the group distance to determine the 
weight factor. In the latter equation, «(L) is taken to be the lu- 
minosity function in the r-band for all galaxies. We used the 
numerical luminosity function in the regions where the lumi- 
nosity function was accurately determined, and used the ana- 
lytical double-power-law approximations only at the bright and 
the faint end. The distance-dependent weight factor is shown in 
Fig. [2] as a solid line. In nearby regions, the weight factor in- 
creases due to the bright limit of the survey (12.5 mag). Further 
away than 400/;"'Mpc, the weight factor increases rapidly due 
to the increasing number of galaxies with apparent luminosities 
lower than the luminosity limit of the survey (17.77 mag). 

To determine the luminosity function, we used the modified 
KnL weighting procedure with kernel smoothing and varying 
kernel widths. The luminosity function n(L) (the number density 
of galaxies) is represented by a sum of kernels centred at the data 
points: 



n(L) = 2 



1 



Vmax(ii) at 



1 (L-Li 
-K 



(4) 



We use the Bji-) spline function (Eq.[8]) with a width a as the ker- 
nel K(-). The kernels are distributions with K(x) > 0, J K(x)dx - 
1, of zero mean. In the latter equation, the kernel widths depend 
on the data, a, = fl(L,); in regions with less data points, we use 
wider kernels. Vmax(L) is the maximum volume where a galaxy 
of a luminosity L can be observed in the present survey, and 
the sum is over all galaxies. This procedure is non-parametric, 
and gives both the form and true normalisati on of the luminosity 
function. We refer to the Te mpel et al.l ( 1201 ll) for a more detailed 
description of the procedure. 

The resulting luminosity function is shown in Fig.|5] together 
with the double-power-law fit. We used the double-power-law in 
the form 



n(L)dL oc (L/L*y [1 + {LILy]~ d(L/L*), 



(5) 



where a is the exponent at low luminosities (L/L*) «: 1, 5 is the 
exponent at high luminosities (L/L*) » 1, y is a parameter that 
determines the speed of transition between the two power laws, 
and L* is the characteristic luminosity of the transition. We find 
the best parameters to be: a = -1.305 + 0.009,5 = -7.13+0.22, 
7 = 1.81 + 0.05, and M* = -21.75 ± 0.05 (corresponds to L*). 

To calculate the expected total luminosities of groups, we 
regard every galaxy as a visible member of a group. For iso- 
lated/single galaxies we made an assumption that only the 
brightest galaxy of the group is vis ible and therefore th e isolated 
galaxy is also part of some group dTempel et alj|200^ . This as- 
sumption is supported by observations of nearby galaxies, which 
indicate that practically all galaxies are located in systems of 
galaxies of various size and richness. 

Assuming that every galaxy also represents a related group 
of galaxies, which may lie outside the observational window of 
the survey, the estimated total luminosity per one visible galaxy 
is 



Ltni — Ln 



(6) 



where Lobs is the observed luminosity of the galaxy. The lu- 
minosity Ltot takes into account the luminosities of unobserved 
galaxies and therefore it can be used to calculate the full lumi- 
nosity density field. 
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Fig. 6. Suppression of finger-of-god redshift distortions for four 
groups. The y-axis shows the distance of a group galaxy in red- 
shift space (grey points) and its corrected distance (black points). 
The jc-axis shows the distance of the galaxy from the group cen- 
tre in the sky in units of /i^'Mpc. 



To determine the luminosity density field, we use a kernel 



gal 



(7) 



where Ltot is the weighted galaxy luminosity, and a - the kernel 
scale. For kernel K(-) we use the B3 spline function: 

\x - 2p - 4|x - Ip + 6\x\' -4\x+l\' + \x + 2p 
B3(x) = — . (8) 

The luminosity density field is calculated on a regular carte- 
sian grid generated by using the SDSS rj and A angular coordi- 
nates. This allows an efficient placement of the field in a cube 
and also a relatively straightforward definition of the sample 
mask. The field mask is designed to follow the edges of the 
galaxy sample in the plane of the sky. The contours of the mask 
are given on Fig. \W\ and we use constant maximum and mini- 
mum distance limits of 55 and 565 /i^'Mpc. 

The estimates of the luminosity density field near the edges 
of the survey are biased since we miss the luminosities of the 
galaxies outside the survey area. For our Bi(-) kernel, the esti- 
mate is not affected, if the distance to the edge is twice as large 
as the smoothing scale. The estimates are fairly reliable also for 
galaxies with the edge distance larger than one smoothing scale. 
To control this effect, we added to the catalogue the distance 
from the edge of the survey for every galaxy. These can be used 
to select galaxies with unbiased estimates of the environmental 
luminosity density. The distance can also be used to determine 
whether a group or cluster is complete or if some of its galaxies 
could lie outside the survey. 

While calculating the density field, we also suppress the 
finger-of-god redshift distortions using the rms sizes of galaxy 
groups in the sky cr^ and their rms radial velocities cr,, (both in 
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physical coordinates at the location of the group). For that, we 
calculate the new radial distances for galaxies c/g^j as 



(9) 



"^gal - -^group + (t/g^i - t/group) ^j'fj^' 

where is the initial distance to the galaxy, and t/group is 

the distance to the group centre. For double galaxies, where the 
extent of the system in the plane of the sky does not have to show 
its real size (because of projection effects), we demand that its 
(co-moving) size along the line-of-sight does not exceed the co- 
moving hnking length diLiz) used to define the system: 



2gal - flgroup + (t^gjjj - "group 



t^eroun + M*„i — t/srounj 
if Iv'i - V2I///0 > didz). 



didz) 



■V2|/i/o 



(10) 



Here z is the mean redshift of the double system. If the velocity 
difference is smaller than that quoted above, we do not change 
galaxy distances. 

The velocity dispersion crj for groups were calculated with 
the standard formula 



1 " 

(1+Z„)2(«-1) 



(11) 



where Vmean and Zm are the mean group velocity and redshift, 
respectively, v, is the velocity of an individual group member, 
and n is the number of galaxies with observed velocities in a 
group. 

In Eq. (|9ll, cr^ defines the extent of the group in the sky, which 
is defined as 



1 



2«(1+Zm)2 



(12) 



where r, is the projected distance in the sky from group centre 
(in co-moving coordinates, in units of /i 'Mpc), and Zm is the 
mean group redshift. 

We note that such a compression will lead to a better esti- 
mate of the density field, but it is unsuitable for a detailed study 
of individual groups and clusters. Figure |6] gives an example of 
suppressing the finger-of-god redshift distortions for four rela- 
tively rich groups. As we see, this procedure makes the galaxy 
distribution in groups approximately spherical, as intended. We 
normalise the density field with respect to the mean luminos- 
ity density. The mean density is calculated as an average over all 
density field vertices (i inside the mask. We find the environmen- 
tal density for all galaxies and groups by linearly interpolating 
the density field values in neighbouring vertices for the location 
of the galaxy or the group. The details of the calculation of th e 
luminosity density field can be found in lLiivamagi et al.l (l2012h . 

5. Galaxy and group parameters added to the 
catalogue 

5.1. Environmental densities 

To estimate the environmental densities, we used the method 
described in Sect. |4] The densities are determined, using the 
SDSS r-band luminosities. Since different smoothing scales rep- 
resent different environments, we calculated the density field 
with various smoothing lengths: 1, 2, 4, 8, and 16/? 'Mpc using 
a 1 /i 'Mpc grid. While the smaller smoothing lengths represent 




1 10 
Normalised density 



Fig. 7. The distribution of the normalised densities for galaxies 
for various smoothing scales: 1, 2, 4, 8, and 16/z^'Mpc. Note 
that the maximum shifts toward lower densities up to the scale 
a - 8/z^'Mpc. 
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Fig. 8. The total luminosity (Z4ot) of groups as a function of the 
normalised density (Di), for the scale a - I /i 'Mpc. The ob- 
served group luminosities are multiplied by the weight factor to 
get the total luminosities. The solid line shows a simple linear 
coiTelation between these two quantities: Ltot = 0.105 ■ D\. 



the group scales, the larger smoothing lengths correspond to the 
large-scale environments, to the supercluster-void network. 

For every galaxy in our sample, we find the density field 
value in the location of the galaxy for all five smoothing scales. 
For groups, we find the mean density for all the galaxies 
in the group. These density field values are included in our 
galaxy/group catalogues. 

Figure |7] shows the distribution of normalised densities for 
galaxies for various smoothing scales: 1, 2, 4, 8, and 16/z^'Mpc. 
The mean density for all smoothing scales is approximately 
0.0165x10"' /iLoMpc^l It is well seen that the maximum shifts 
towards the mean value when moving toward higher smooth- 
ing scales. However, there is no shift after the smoothing scale 
reaches a - 8/i"'Mpc, indicating that a wider smoothing does 
not reveal any new structures in the density field. The smoothing 
scale fl = 8 /i^'Mpc was also used t o find the largest struct ures 
in the cosmic web - superclusters bv lLiivamagi et al] (l2012h . 

Figure |8] shows the normalised density for the smoothing 
scale a - I /i 'Mpc versus the expected total luminosity of 
groups. The expected total luminosity is the product of the ob- 
served luminosity with the weight factor. There is a clear corre- 
lation between this density and the luminosity of groups. 

The environmental densities refer to different structures in 
the cosmic web. The small smoothing scale (a - 1 /i^'Mpc) 
describes the group environment and can be therefore used as 
a local density estimator. The larger smoothing scale (a = 
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Fig. 9. The distribution of the iHuertas-Companv et alJ (1201 ll) 

marks of early- or late-type galaxies for our classified spirals 
(blue solid line) and ellipticals (red dashed line). 



8/!"'Mpc) corresponds to the large-scale environment, to the 
supercluster-void network. Hence, for studying the environmen- 
tal effects for small and large scales, different density estimators 
should be used. 

In our density field estimation, the mean density is distance 
independent, since we use a proper weight factor However, at 
larger distances we see only the brightest galaxies and the miss- 
ing luminosity is added only to these locations. Hence, for these 
distances, the high peaks in the density field are more prominent. 
The effect is stronger for smaller smoothing scales. Our experi- 
ence has shown that for distances smaller than 500/; 'Mpc, the 
densities can be used safely for smoothing scales a = 4/i^'Mpc 
and larger For smaller smoothing scales, the environmental den- 
sities are reliable in the regions, where the weight factor is less 
than two or three. 

5.2. Galaxy morphology 

Galaxy morphology is an important aspect in galaxy evolution 
studies. For the SDSS sample, the galaxy morphologies have 
been estimated by the Galaxy Zoo project, yielding a reliable 
visual classification for the majority of galaxies in SDSS sam- 
ple. However, the visual classification is subjective and needs to 
be tes ted with other methods. 

In lTempel et al.l (l201 ih we carried out a morphological clas- 
sification of the SDSS galaxies, using various galaxy parame- 
ters. This classification takes into account the SDSS model fits, 
apparent ellipticities (and apparent sizes), and different galaxy 
colours. 

Recently, iHuertas-Companv et alj ( 1201 lb published a mor- 
phological classification of galaxies of the SDSS, based on the 
Galaxy Zoo data ( Lintott et al. 2008). They associate with each 
galaxy a probability (mark) of being in the four morphologi- 
cal classes: two early-type classes (E and SO) and two late-type 
classes (Sab and Scd). To compare it with our classification we 
assign the IHuertas-Companv et al.l (1201 lb probability of being 
early- or late-type to our galaxies. In Fig. |9] the distributions 
of these marks are s hown. It is well seen that our cl assification 
agrees well with the iHuertas-Companv et alj (1201 lb classifica- 
tion. 

In our catalogue we give a flag for the galaxy being a spi- 
ral or an elliptical. The flag is gi v en on ly for these galaxies, 
where the iHuertas-Companv et alj (1201 Ih mark is greater than 
0.5. Hence, our classification is rather conservative. In our clas- 
sification about half of the galaxies (45%) are spirals, about one 
quarter (26%) are ellipticals and for 29% of the galaxies, the 
classification is unclear 
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Fig. 10. The distribution of the fibre collision galaxies in the 
SDSS survey. The survey mask used for the density field cal- 
culation is shown by grey lines. 



5.3. Fibre collisions 

In the SDSS, for about 6% of the galaxies the redshifts are not 
measured due to fibre collisions. The minimum separation be- 
tween the galaxies (fibres) is 55". The distribution of missing 
galaxies is not uniform in the SDSS, since in overlapping re- 
gions, the close neighbours are observed. 

To find the galaxies with unmeasured redshifts in our sam- 
ple, we used the SDSS CAS tables sdssTilinglnfo and 
sdssTiledTargetAll. From these tables, we get the list of 
all unmeasured galaxies and the tiling group number The tiling 
group number and galaxy coordinates are used to find the ob- 
served neighbouring galaxies. 

For every unobserved galaxy, we find a closest neighbour in 
the spectroscopic galaxy sample and raise the missing galaxies 
flag (flagfc) for that galaxy. Of course, several galaxies in the 
photometric sample can be close to the same observed galaxy. 
The flag value flagfc gives the number of missing galaxies 
close to it. 

The missing neighbours can be true neighbours of the 
gala xy, but they can also be foreground or background galaxies. 
Zeh avi et alJ (l2002h shows that about 60% of the galaxies have 
a redshift close to the neighbour, observed for the redshift. Our 
visual inspection of such close pairs confirms that about half of 
the missing neighbours seem to be associated with the observed 
counterparts. 

Figure[TO]shows the distribution of fibre colUsion galaxies in 
the SDSS sample. The distribution is quite uniform, except in a 
few regions, where the number of missing galaxies is larger 



6. Conclusions and discussion 

We have constructed a group catalogue for the SDSS DR8 sam- 
ple, following the same procedure that we used for the DR7 sam- 
ple (,Tago et aL 12010 ). Since the spectroscopic data is the same 
in the DR7 and DR8, the new catalogue is similar to the previ- 
ous DR7 group catalogue. The improvements are in the area of 
initial galaxy selections and from the SDSS side, the photometry 
of galaxies. 

In addition to the properties, presented in our previous cata- 
logue, we added some new qualitative information. Most impor- 
tantly, we tracked the missing galaxies in the SDSS (due to fibre 
collisions) and calculated the environmental density parameters 
for each galaxy and group. We also a dded our galaxy morphol- 
ogy as derived in lTempel et alJ (1201 lb . 
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In the study of galaxy groups, the most important problem at 
present is the dynamical status of groups of galaxies. Recently 
[Plionis et al. (2006) and Tovmassian & Plionis (2009) studied 
shapes and virial properties of groups and found a strong de- 
pendence on richness and concluded that groups are not in dy- 
namical equilibrium but rather are at various stages of their viri- 
alisation process. 

The dynamical status of groups is also characterised by 
the exis tence of subgroups and substructures s een in many 
studie s dBurgett et al.ri2004 ICoziol et all 120091: lEinasto et all 
I2OIOI) . These studies also support the non-virialised nature of 
groups of ga laxies. With both observational and simulated data 
iNiemi et"an (12007 ) showed that about 20% of nearby groups are 
not bound and are groups merely in a visual sense. 

The controversial results obtained by various authors may 
be an indication that our present knowledge of groups of galax- 
ies is poor An optimistic viewpoint is that the study of a broad 
and inhomogeneous class of galaxy systems - groups of galax- 
ies - will help step by step to solve important problems of galaxy 
formation and evolution, and of the evolution of the large scale 
structure. 
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Appendix A: Description of the catalogue 

The catalogue of groups and clusters of galaxies consists of two 
tables. The first table lists the galaxies that we used to generate 
our catalogue of groups and clusters, and the second one de- 
scribes the group properties. Both these catalogues include all 
the basic entities (distances, coordinates, luminosities, etc) that 
were present in our previous catalogues, as well the new param- 
eters that are described in this paper 

The catalogues are accessible at 

http://www.aai.ee/~elmo/dr8groups/ with a com- 
plete description in the readme . txt file. We give these 
catalogues as a fits table with two extensions: one for galaxies 
and second one for groups. We will also upload the catalogues 
to the Strasbourg Astronomical Data Center (CDS). 

A.1. Description of tine gaiaxy cataiogue 

The galaxy catalogue contains the following information (the 
column numbers are given in square brackets): 

1 . [ 1 ] id - a unique identification number for galaxies, used by 
us; 

2. [2] idcl - the group/cluster id; 

3. [3] nrich - the richness of the group the galaxy belongs to; 

4. [4]redshift - the redshift, corrected to the CMB rest 
frame; 

5. [5] dist - the co-moving distance in units of Mpc (cal- 
culated directly from the redshift); 

6. [6]distcl - the co-moving distance to the group/cluster 
centre, where the galaxy belongs to, in units of Mpc, 
calculated as an average over all galaxies, belonging to the 
group/cluster; 

7. [7-8] ra , dec - the right ascension and declination (deg); 

8. [9-10] Ion, lat - the galactic longitude and latitude (deg); 
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9. [11-12] eta, lam - the SDSS survey coordinates t] and A 
(deg); 

10. [13-17] mag jc - the Galactic extinction corrected Petrosian 
magnitude (x e ugriz filters); 

11. [18-22] absmagjc - the absolute magnitude of the galaxy, 
A;+e-corrected (x E ugriz filters, in units of mag + 5 logjg h); 

12. [23-27] kcor_x - the /t-correction by the KCORRECT algo- 
rithm (x e ugriz filters); 

13. [28] lumr - the observed lumi nosity in the r-band in un its of 
lQ^°h-^L0, where Mq = 4.64 dBlanton & Roweisll2007h : 

14. [29] w - the weight factor for the galaxy (w-lumr was used to 
calculate the luminosity density field); 

15. [30] rank - the galaxy rank in its group, calculated based 
on the galaxy luminosity: for the most luminous galaxy, the 
rank is 1 ; 

16. [31-35] derifl - the normalised environmental density 
of the galaxy for various smoothing scales (a - 
1 , 2, 4, 8, 16 /i"' Mpc); for galaxies outside our survey mask 
we use the value -999, indicating that the galaxy is outside 
the mask; 

17. [36] edgedist - the co-moving distance of the galaxy from 
the border of the survey mask; 

18. [37] morf - the morphology of the galaxy (0 - unclear, 1 - 
spiral, 2 - elliptical) as described in Sect. | 5.2[ 

19. [38]hcearly - the iHuerta s-Compa nv et al.1 (l201lh mark 
(probability) of being an early type galaxy; 

20. [39] drSob j id - the SDSS DR8 photometric object identifi- 
cation number; 

21. [40] drSspecobjid - the SDSS DR8 spectroscopic object 
identification number; 

22. [41] iddr? - an unique identification number to link with 
the entries in our previous DR7 group catalogue; 

23. [42] zobs - the observed redshift (without the CMB correc- 
tion, as given in the SDSS CAS); 

24. [43] distcor - the co-moving distance of the galaxy when 
the finger-of-god effect is suppressed (as used in luminosity 
density field calculations); 

25. [44] flagfc - if greater than zero, indicates that the galaxy 
has a missing (with unknown redshift) neighbour due to a 
fibre colUsion. 

A.2. Description of tlie group/cluster catalogue 

The catalogue of groups/clusters contains the following infor- 
mation (the column numbers are given in square brackets); 

1 . [1] idcl - the group/cluster id; 

2. [2] nrich - the group richness, number of observed galaxies 
in group; 

3. [3] zcl - the CMB-corrected redshift of group, calculated as 
an average over all galaxies, belonging to the group/cluster; 

4. [4]distcl - the co-moving distance to the group centre 
(/i-'Mpc); 

5. [5-6]racl, deed - the equatorial coordinates, the right 
ascension and declination (deg); 

6. [7-8]loncl, latcl - the galactic longitude and latitude 
(deg); 

7. [9-10] etacl , lamcl - the SDSS survey coordinates t] and 
A (deg); 

8. [11] sizesky - the maximum linear size of the group in the 
sky (in physical coordinates, /i^'Mpc); 

9. [12] rvir - the virial radius in /i 'Mpc (the projected har- 
monic mean, in physical coordinates); 



10. [13] sigma_sky - the rms deviation of the projected distance 
in the sky from the group centre (cr,- in physical coordinates, 
in /i 'Mpc), defines the extent of the group in the sky; 

11. [14] sigma_v- the rms radial velocity deviation (cTv in phys- 
ical coordinates, in km s"'); 

12. [15]lumobs_r - the observed luminosity, the sum of ob- 
served galaxy luminosities (IO'^/j^^Lq); 

13. [16] lumtot_r - the estimated total luminosity of the group 
(IO'^/i^^Lq), the observed luminosity multiplied by a weight 
factor; 

14. [17] dlink- the transverse co-moving linking length used to 
define the group (/; 'Mpc). The ratio between the radial and 
the transversal linking lengths is taken to be 10 (the radial 
linking length in kms ' is transformed into a formal "dis- 
tance" in /z^'Mpc); 

15. [1 8-22] denacl - the environmental density for the group for 
various smoothing scales (a - 1, 2, 4, 8, 16 /j^'Mpc), aver- 
aged over group galaxies inside the survey mask; 

16. [23] edgedistcl - the minimum co-moving distance of 
group galaxies from the edge of the survey mask. Zero, if 
at least one member is outside the survey mask. 

17. [24] f lagf ccl - if greater than zero, indicates that the group 
contains potentially missing (with unknown redshift) galax- 
ies due to the fibre collisions. 
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